Speech Enhancement Using U-Net with Compressed Sensing
نویسندگان
چکیده
With the development of deep learning, speech enhancement based on neural networks had made a great breakthrough. The methods U-Net structure achieved good denoising performance. However, part them rely ordinary convolution operation, which may ignore contextual information and detailed features input speech. To solve this issue, many studies have improved model performance by adding additional network modules, such as attention mechanism, long short-term memory (LSTM), etc. In work, therefore, time-domain combines lightweight Shuffle Attention mechanism compressed sensing loss (CS loss) is proposed. dilated residual blocks are constructed used for down-sampling up-sampling in model. added to final output encoder focusing suppressing irrelevant audio information. A new defined using measurements clean enhanced sensing, it can further remove noise noisy experimental part, influence different functions proved through ablation experiments, effectiveness CS verified. Compared with reference models, proposed obtain higher quality intelligibility scores fewer parameters. When dealing outside dataset, still achieves performance, proves that not only achieve effect, but also has generalization ability.
منابع مشابه
Speech enhancement using compressed sensing
This paper proposes an approach based on the compressed sensing for speech enhancement. It attempts to exploit the fact that it is relatively easy to find sparse representation for speech signal but the same cannot hold true for noise. Thus for a given speech signal a sparse vector is derived which extracts the speech signal from the given noisy speech signal. Approach is further modified to en...
متن کاملSpeech Enhancement Based on Compressed Sensing Technology
Compressed sensing (CS) is a sampled approach on signal sparsity-base, and it can effectively extract the information which is contained in the signal. This paper presents a noisy speech enhancement new method based on CS process. Algorithm uses a voice sparsity in the discrete fast Fourier transform (Fast Fourier transform, FFT), and complex domain observation matrix is designed, and the noisy...
متن کاملFrames for compressed sensing using coherence
We give some new results on sparse signal recovery in the presence of noise, for weighted spaces. Traditionally, were used dictionaries that have the norm equal to 1, but, for random dictionaries this condition is rarely satised. Moreover, we give better estimations then the ones given recently by Cai, Wang and Xu.
متن کاملframes for compressed sensing using coherence
we give some new results on sparse signal recovery in the presence of noise, forweighted spaces. traditionally, were used dictionaries that have the norm equal to 1, but, forrandom dictionaries this condition is rarely satised. moreover, we give better estimationsthen the ones given recently by cai, wang and xu.
متن کاملAn Adaptive Compressed Sensing Method in Speech
The application of an adaptive compressive sensing method in the speech signal processing is proposed in this paper. First, the threshold of wavelet transform is used to preprocess the speech signal. Then, according to the parameters of the speech frame, each frame is adaptively assigned a measurement number. Finally, the measurement matrix is used to reconstruct the speech signal. Experimental...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app12094161